Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis
نویسندگان
چکیده
منابع مشابه
Asynchronous F0 and spectrum modeling for HMM-based speech synthesis
This paper proposes an asynchronous model structure for fundamental frequency(F0) and spectrum modeling in HMMbased parametric speech synthesis to improve the performance of F0 prediction. F0 and spectrum features are considered to be synchronous in the conventional system. Considering that the production of these two features is decided by the movement of different speech organs, an explicitly...
متن کاملA hierarchical F0 modeling method for HMM-based speech synthesis
The conventional state-based F0 modeling in HMM-based speech synthesis system is good at capturing micro prosodic features, but difficult to characterize long term pitch patterns directly. This paper presents a hierarchical F0 modeling method to address this issue. In this method, different F0 models are used to model the pitch patterns for different prosodic layers (including state, phone, syl...
متن کاملResidual-Based Excitation with Continuous F0 Modeling in HMM-Based Speech Synthesis
In statistical parametric speech synthesis, creaky voice can cause disturbing artifacts. The reason is that standard pitch tracking algorithms tend to erroneously measure F0 in regions of creaky voice. This pattern is learned during training of hidden Markov-models (HMMs). In the synthesis phase, false voiced / unvoiced decision caused by creaky voice results in audible quality degradation. In ...
متن کاملStatistical Approaches to Excitation Modeling in HMM-Based Speech Synthesis
In our previous study, we proposed the waveform interpolation (WI) approach to model the excitation signals for hidden Markov model (HMM)-based speech synthesis. This letter presents several techniques to improve excitation modeling within the WI framework. We propose both the time domain and frequency domain zero padding techniques to reduce the spectral distortion inherent in the synthesized ...
متن کاملDuration modeling for HMM-based speech synthesis
This paper proposes a new approach to state duration modeling for HMM-based speech synthesis. A set of state durations of each phoneme HMM is modeled by a multi-dimensional Gaussian distribution, and duration models are clustered using a decision tree based context clustering technique. In the synthesis stage, state durations are determined by using the state duration models. In this paper, we ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Audio, Speech, and Language Processing
سال: 2011
ISSN: 1558-7916,1558-7924
DOI: 10.1109/tasl.2010.2076805